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Because of chemical interconversion, many 
chemical systems cannot be physically separated, 
making chemical identification and quantification 
difficult. The spectra (IR, UV, Visible, Raman, 
CD, etc.) of such systems exhibit overlapping con- 
tributions of uncataloged components, confound- 
ing the identification as well as the quantification. 
Strategies based on factor analysis [1], a chemomet- 
ric technique for handling complex multi-dimen- 
sional problems, are ideally suited to such 
problems. Abstract factor analysis (AFA) reveals 
the number of spectroscopically visible compo- 
nents. Evolutionary factor analysis (EFA) [2-4] 
takes advantage of experimental variables that con- 
trol the evolution of components, revealing not 
only the concentration profiles of the components 
but also their spectra even when there are no 
unique concentrations or spectral regions. 

Evolutionary factor analysis makes use of the 
fact that each species has a single, unique maximum 
in its evolutionary concentration distribution 
curve. We have recently applied this self-modeling 
method to the infrared spectra of stearyl alcohol in 
carbon tetrachloride solution. The evolutionary 
process of this system was achieved by increasing 
the concentration of stearyl alcohol from 0.0090 to 
0.0800 g/L in 15 stages, each time recording the IR 
spectra from 3206 to 3826 cm '. The spectra were 
corrected for baseline shift, solvent absorption and 
reflectance losses. The 15 spectra were then digi- 
tized every 3 cm ' and assembled into a 35 X 15 ab- 
sorbance matrix [A] appropriate for factor analysis. 

The factor indicator function [1], the reduced ei- 
genvalue [5] and cross validation [6] indicated that 
three species contribute to the observed spectra. 
Thus AFA expresses the data matrix as a product 
of a 35x3 absorpfivity matrix [£]abM and 3x15 ab- 
stract concentration matrix [CJ^hsi. 

[A] = [EUACU 



Because the abstract matrices are mathematical so- 
lutions devoid of chemical meaning, they must be 
transformed into physically meaningful absorptivi- 
ties and concentrations. This is accomplished by 
target transformation factor analysis (TFA) [1], a 
powerful technique which allows one to test fac- 
tors individually without requiring any a priori in- 
formation concerning the other factors. A test 
vector C|eM emulating an evolutionary profile can 
be converted into a predicted vector Cp^„j that lies 
completely inside the factor space by finding a 
transformation vector T that minimizes the sum of 
squares of the difference between C|,.s, and Cp^^.j. 
Accordingly, C^„.^ is given by; 

in which 7" = C,„, [C]V, {[C],HM [C]',,J-' 

The prime indicates matrix transposition. 

These equations were used to target test 15 ideal 
(Dirac delta function) concentration profiles, rep- 
resented by uniqueness tests for each column of the 
absorbance matrix. A uniqueness vector consists of 
zeros for all elements except the one in question, 
which contains unity: 

C, = ( 1,0,0,. ,.,0,0,0) 
C; = (0, 1,0,. ..,0,0,0) 



C„ = (0,0,0,. ...0,0,1) 

The three predicted vectors with maxima corre- 
sponding to the unique point of the respective test 
vector were retained as likely candidates. The.se 
crude profiles were refined, individually, by apply- 
ing simplex optimization to the respective transfor- 
mation vector, using a response function designed 
to minimize negative concentration points and dou- 
ble maxima in the profile. 

Further refinement was achieved by the follow- 
ing iteration. Because negative regions are meaning- 
less, all data beyond the boundaries marked by the 
first negative regions encountered on the lefi and 
on the right of the peak maximum were truncated. 
These profiles were normalized so the sum of 
squares equals unity and then assembled into a con- 
centration matrix [C]. The pseudoinverse equation 

[E]^{A\[CY {[C][C]-} ' 

was used to calculate the spectral matri,\, followed 
by another pseudoinverse 
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[C] = i[E]'[E]y'[E][A] 

to recalculate the concentration profiles. This pro- 
cess (truncation, normalization and pseudoinverse 
followed by pseudoinverse) was repeated until no 
further refinement occurred. 

The concentration profiles and spectra of the 
three unknown components of stearyl alcohol in 
carbon tetrachloride obtained in this manner were 
found to make chemical sense. 

This EFA procedure, unlike others, was success- 
ful in extracting concentration profiles from situa- 
tions where one component profile was completely 
encompassed underneath another component 
profile. 
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tions to the field. Obviously, this is not the place to 
offer a review on chemometrics, let alone one that 
is restricted to a continent. 

The definition of chemometrics [1] comprises 
three distinct areas characterized by the key words 
"optimal measurements," "maximum chemical in- 
formation" and, for analytical chemistry something 
that sounds like the synopsis of the other two: "op- 
timal way [to obtain] relevant information," 



Information Theory 

Eckschlager and Stepanek [2-5] pioneered the 
adaption and application of information theory in 
analytical chemistry. One of their important results 
gives the information gain of a quantitative deter- 
mination [5] 
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where q and p are the prior and posterior distribu- 
tion of the analyte concentration for the specific 
cases of a rectangular prior distribution in {xi,X2) 
and a Gaussian posterior with a standard deviation 
J determined from n^ independent results. The 
penalty for an inaccurate analysis is considerable 
and can be expressed as 
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Chemometrics is a very international branch of 
science, perhaps more so than chemistry at large, 
and it is therefore appropriate to question the suit- 
ability of the topic to be presented. It is, however, 
the author's opinion that the profile of European 
chemometric research has a couple of distinct fea- 
tures that may originate more in the structure of 
the educational system than in the actual research 
topics. The profile as it will be presented is the one 
perceived by the author, and therefore comprises a 
very subjective selection of individual contribu- 



with d the difference between obtained value and 
the true value of j:. The concept has also been ex- 
tended to multicomponent analysis and multi- 
method characterization. In the latter case, 
correlations between the information provided by 
the different methods need to be accounted for. 
Given the cost of and time needed for an analysis, 
information efficiency can be deduced in a straight- 
forward manner [2]. Recently, work was published 
[5] suggesting the incorporation of various rele- 
vance coefficients; this, indeed, is a very important 
step since it provides a way to single out the infor- 
mation that is judged to be relevant for a given 
problem. It also opens up the possibility to draw on 
information theory for defining objective functions 
in computer-aided optimization of laboratory pro- 
cedures and instruments. 
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